Extending the definition of beta-consistent biclustering for feature selection

نویسنده

  • Antonio Mucherino
چکیده

Consistent biclusterings of sets of data are useful for solving feature selection and classification problems. The problem of finding a consistent biclustering can be formulated as a combinatorial optimization problem, and it can be solved by the employment of a recently proposed VNS-based heuristic. In this context, the concept of β-consistent biclustering has been introduced for dealing with noisy data and experimental errors. However, the given definition for β-consistent biclustering is coherent only when sets containing non-negative data are considered. This paper extends the definition of β-consistent biclustering to negative data and shows, through computational experiments, that the employment of the new definition allows to perform better classifications on a well-known test problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Heuristic for Feature Selection by Consistent Biclustering

Given a set of data, biclustering aims at finding simultaneous partitions in biclusters of its samples and of the features which are used for representing the samples. Consistent biclusterings allow to obtain correct classifications of the samples from the known classification of the features, and vice versa, and they are very useful for performing supervised classifications. The problem of fin...

متن کامل

Biclustering and Feature Selection Techniques in Bioinformatics

The paper describes several data mining techniques, developed to solve problems which are faced by biologists in Bioinformatics.Several biclustering algorithms which perform clustering on the two dimensions simultaneously are described. Other techniques described in this paper include feature selection methods which help in reducing noise and improving the performance of the classification model.

متن کامل

Applying Feature-Selection Algorithm to Predict Landslide in the Southwest of Iran

Extended abstract 1- INTRODUCTION Nowadays people have an increased sensitivity towards landslides especially in mountainous areas using change in the land use and the expansion of communication networks (Gvrsysky et al., 2006). In the twentieth century, Asia has allocated the highest incident of landslides (220 landslides). Latin America has had the highest number of casualties (more than 2,...

متن کامل

Models and Issues in Consistent Biclustering

Biclustering is a methodology allowing simultaneous partitioning of a set of samples and their features into classes. Samples and features classified together are supposed to have a high relevance to each other which can be observed by intensity of their expressions. The notion of consistency for biclustering is defined using interrelation between centroids of sample and feature classes. Consis...

متن کامل

Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets

Objective(s): This study addresses feature selection for breast cancer diagnosis. The present process uses a wrapper approach using GA-based on feature selection and PS-classifier. The results of experiment show that the proposed model is comparable to the other models on Wisconsin breast cancer datasets. Materials and Methods: To evaluate effectiveness of proposed feature selection method, we ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011